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A METHOD OF ESTIMATING AND CORRECTING CAMERA 
ROTATION WITH VANISHING POINT LOCATION 

CROSS REFERENCE TO RELATED APPLICATIONS 
5 Reference is made to U. S. Provisional Serial No. 60/192,400, filed 

March 27, 2000, entitled Correction of Small Image Rotations Based on 
Vanishing Point Detection by Andrew C. Gallagher. 

FIELD OF THE INVENTION 
10 The invention relates generally to the field of image processing. 

More specifically, the invention relates to estimating and correcting for 
unintentional rotational camera angles that occur at the time of image capture, 
based upon the captured image's corresponding location of its vanishing points. 
Furthermore, the invention relates to image warping; the process of warping an 
1 5 image in a manner that causes the vanishing points of the image to relocate. 

BACKGROUND OF THE INVENTION 

Conventional consumer cameras (both film and digital) capture an 
image having a rectangular imaging area. For cameras using 35mm film, the 

20 horizontal dimension of the imaging area is approximately 36mm and the vertical 
dimension of the imaging area is approximately 24 mm. 

Many photographers find it very difficult to hold a camera level 
when capturing a photograph. A photographer often pivots the camera slightly 
about the optical axis (whereby the optical axis is defined as an axis passing 

25 through the center of the image capture frame and the center of the camera lens). 
These small rotations are generally unintentional and may occur because 
conventional consumer cameras are lightweight, for example, the popular Single- 
Use cameras. Rotations about the optical axis give the appearance that the subjects 
in the photograph are off-centered, leaning, or tilted. 

30 U.S. Patent No. 6,01 1,585 issued January 4, 2000 to Anderson, 

entitled Apparatus and Method for Rotating the Display Orientation of a 



Captured Image describes a method of determining image format and orientation 
from a sensor present in the camera at the time of image capture. Whereas, this 
sensor can enable one to determine the orientation or format of a captured image, it 
cannot lead to detecting a small amount of camera rotation. Instead, the sensor 
5 identifies the major image orientation (in increments of 90 degrees) by determining 
which side of an image corresponds with the "top" of the image. In addition, the 
Anderson method necessitates a sensor of this sort be present within a camera. 

Lutton et al. (in "Contribution to the Determination of Vanishing 
^ Points Using Hough Transform," IEEE Trans. Pattern Analysis and Machine 

=0 10 Intelligence, Vol. 16, No. 4, pp. 430-438, Apr. 1994) attempts to detect the 

|T§ vertical direction of an image. The Lutton et al. article teaches one to select the 

!2 s direction that is orthogonal to the most directions in the scene. The implicit 

assumption is that the scene will contain many horizontal lines. However, this is 
s not always the case. Additionally, the Lutton analysis is performed with a 

fJ5 15 possibly large number of line segments, rather than vanishing points. The Lutton 

TZ method requires a great deal of processing and may be computationally costly. 

O Some existing systems detect and correct a skewed image. These 

~~ systems are primarily in the field of document imaging. Nevertheless, documents 

have a fairly well defined structure, as opposed to more unpredictable consumer 
20 type photographs. Another sharp contrast between document imaging correction 

and photographic imaging correction is that vanishing point detection is not used in 
the field of document imaging, nor is vanishing point detection required. 

Pending U.S. Application Serial No. 09/281,574, filed March 30, 
1999 by Gallagher, entitled A Method for Modifying the Perspective of a Digital 
25 Image, shows a process for compensating from the perspective of image capture. 
However, this solution cannot lead to compensating for camera rotation about the 
optical axis. The perspective of image capture relates to tipping the camera in the 
up and down direction (i.e. about the x-axis), which is much different than rotating 
a camera about the optical axis. Therefore, pending U.S. Application 09/281,574 
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is limited to describing a method of correction exclusively unique to the problem 
of camera rotation about the x-axis. 

Consequently, a need exists for overcoming the above described 
drawbacks. More specifically, a need exists for determining slight image 
5 orientation changes. 



SUMMARY OF THE INVENTION 

The need is met according to the present invention by providing a 
method of generating an image transform for modifying a digital image, that 
10 includes the steps of: detecting a vanishing point related to the selected image; 
determining a preferable vanishing point location, and generating an image 
transform based on the vanishing point location and the preferable vanishing point 
location. 

In another embodiment, the present invention provides a method 
1 5 for detecting an amount of rotation between the vertical axes of a scene and an 
image of the scene, that includes the steps of: detecting a set of vanishing points 
related to the image; selecting a set of vanishing points corresponding to a vertical 
axis of the scene based on a predetermined criteria; and using the selected 
vanishing points to detect the rotation of the image. 
20 The present invention has an advantage of improving the method of 

correcting for small angles of camera rotation, i.e. camera tilt. 



BRIEF DESCRIPTION OF THE DRAWINGS 
The subject matter of the invention is described with reference to 
25 the embodiments shown in the drawings. 

Fig. 1 is a schematic representation of the system providing a 
method of determining the amount of rotation present in an image and a means for 
correcting the rotation; 

Fig. 2 is a schematic of the reference systems employed to describe 
30 the present invention; 



Figs. 3 A-C show several examples of images with different formats 
and orientations; 

Figs. 4A-B show the effects on an image as a result of small angle 
camera rotation; 

5 Figs. 5 A-B show an actual image that has been corrected by the 

method of this invention; and 

Fig. 6 is a diagram useful in describing the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 

10 In the following description, the present invention will be described 

in the preferred embodiment as a method. Those skilled in the art will readily 
recognize that the equivalent of such a method may also be constructed as 
hardware or software within the scope of the invention. 

Fig. 1 shows a block diagram of the present invention. The 

1 5 purpose of the present invention is to estimate the amount of rotation of a camera 
relative to the scene at the time of image capture, based upon a digitized 
representation of an image. The source of the digitized image is irrelevant. The 
digitized image may be a scan of a film negative, a scan of a photograph, or an 
image captured with a digital camera. It should be well understood that in cases 

20 where the digitized image is a scan of a hardcopy image that the rotation of the 
digitized image corresponds to the rotation of the source image. That is, if a 
photograph was captured with a camera that was tilted by p degrees, then the 
corresponding scan of that photograph (or the scan of the photographic negative) 
will also be tilted by P degrees. Thus, the method described herein may be used to 

25 automatically determine the amount of camera rotation at the time of capturing the 
photograph, for example; by first digitizing the photograph and then analyzing the 
resulting scan with the method of the preferred embodiment. In this case, the 
source image is the photograph and the digital image is a result of the scan. It 
should be further understood that the source image may for example be a large 

30 resolution digital image. This source image may then be decimated to generate 
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the digital image that the method of the present embodiment operates upon. 
Again, the result from the present invention applies to both a source image and a 
digital image. Note that the preferred embodiment is described with reference to 
digital images having a dimension of 512 pixels by 768 pixels, although those 
5 skilled in the art will recognize that many image resolutions may be utilized with 
equal success. For example, the present invention is also suitable for processing a 
digital image having dimensions of 512 pixels by 512 pixels. Note also that the 
terms "digitized image", "digital image", and "image" are used synonymously 
q throughout this description. 

[if 1 0 Vanishing points are a useful feature for determining the amount of 

r=H rotation of an image because of the many vertical parallel lines associated with 

□ human construction. Despite a left to right positioning of the camera (or other 

m image capture device), the vanishing point associated with the vertical scene lines 

|L nearly always falls near the vertical axis of the image. 

y3 1 5 Fig. 2 shows a reference system employed with the present 

in invention. A focal point 2, representing the approximate location of the focal 

g point of the lens used to capture the image, is located a distance / from the image 

plane 3. The focal point 2 represents the center of a Gaussian sphere. The x-axis 
andjy-axis define the dimensionality of the image plane 3. The z-axis is also 
20 defined as the "optical axis" of the system. The x' -axis and they -axis define a 
plane that is parallel to the image plane 3. The image origin 4 is defined as the 
point of intersection of the image plane with the optical axis, and is given in 
Cartesian coordinates as (0,0,y). Generally, the image origin is assumed to be at 
the center of the distribution of pixels comprising the digital image, although this 
25 assumption may not be correct. In fact, if the camera is constrained to only pivot 
up or down (pivoting solely about the ;c-axis), the vanishing point associated with 
vertical scene lines must fall on the vertical axis of the image. For many 
photographers, a pleasing captured image results by maintaining a level camera 
during shooting the image. Attempting to keep the camera level applies whether 
30 the camera is held by hand or placed on a tripod or some other mechanical device. 
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Because it is desirable to capture an image with a camera held level (i.e., pivoting 
solely about the jc-axis), and such practice results in the vanishing point most 
closely associated with the image's vertical scene lines to fall on the vertical axis 
of the image, consequently, the vertical axis of the image is defined as a preferable 
5 vanishing point location for achieving a user desired image composition. In other 
words, the captured image is preferably represented as a level depiction of the 
captured scene with respect to the vertical lines within the scene. However, if the 
camera is rotated about the optical axis (the z-axis), then the vanishing point 
associated with vertical scene lines will not fall on the vertical axis of the image. 
10 However, the angle from the vanishing point corresponding to the vertical scene 
lines to the vertical axis of the image is equivalent to the amount that the camera 
was rotated about the optical axis. 



horizontal lines (lines orthogonal to the vertical lines). If these horizontal lines 
1 5 happen to be parallel to the image plane (orthogonal to the optical axis), then a 

vanishing point for the horizontal lines will occur at infinity on the horizontal axis 
of the image. However, depending upon the position of the photographer, it is 
very likely that the horizontal scene lines will not be parallel to the image plane. 
If this condition exists and the camera is tilted about the x-axis, then the vanishing 
20 point associated with the horizontal scene lines will not fall on either axis of the 
image plane. Thus, the vanishing point associated with horizontal lines in the 
scene may fall on the horizontal axis of the image, but because of the scene 
composition it is just as likely that the vanishing point will fall elsewhere. Hence, 
the vanishing point location corresponding to the horizontal lines in the scene is 
25 not constrained to fall near an image axis, but it is highly likely that the vanishing 
point associated with the vertical lines of the scene will fall near the vertical axis 
of the image. Conversely, if a vanishing point is located near an image axis, it is 
far more likely that this vanishing point corresponds to a set of vertical scene lines 
than a set of horizontal scene lines. 



In a human-made construction, there are also a large number of 



Fig. 2 discloses an example of expressing a vanishing point 
location. First, a vanishing point may be expressed as a location on the image 
plane. The vanishing point v may be expressed as the point at location v = (x Q , y Q , 
f). Such a representation of the vanishing point location performs well when the 
5 vanishing point is located near the image origin, however, the coordinate locations 
along the x-axis and the y-axis may grow quite large. Another aspect of the 
invention illustrated in Fig. 2 is a vanishing point representation scheme, 
commonly used in the field of vanishing point detection. In this representation, 
the vanishing point is represented with a Gaussian mapping. As such, each 
10 location v on the image plane has a unique location vq on the Gaussian sphere. 
The relationship between the vanishing point location on the image plane 
v = (x 0 ,y 0i f) and the vanishing point vector vq determined by the Gaussian 
mapping is given with the equation: 



v 




1 5 Likewise, if the vanishing point vector vq = (xq, yQ, zq) is known 

and the vanishing point location in the image plane is desired, then the following 
projection is used: 

z 

One can easily see that the vanishing point vector vq is a unit 
20 vector extending in the direction from the focal point to the vanishing point 
location in the image plane. With reference to Fig. 2, the image plane is 
positioned / distance from the optical origin of the system. Normally / distance 
represents the focal length. If the focal length is unknown, then a reasonable guess 
may be used. In one embodiment, /is the diagonal length of the imager. For 
25 example, where an image has a dimension of 512 pixels by 768 pixels,/ equals 
923. The vanishing point on the image plane may then be represented as the unit 
vector that points from the optical system's origin to the vanishing point on the 
image plane. This vector is of length one and may be described as the coordinates 
of the intersection of a Gaussian sphere (a sphere of radius 1 .0) centered at the 



optical system origin (the focal point) and the line passing through both the optical 
system origin and the vanishing point on the image plane. This vector 
representation of the vanishing point is advantageous because it contains the entire 
space of the image plane. 
5 In one embodiment of the invention as shown in Fig. 1 , a digital 

image is input to a vanishing point detector 12. The purpose of the vanishing 
point detector is to identify the locations of the vanishing points of the digital 
image. As is well known in the art, a vanishing point is the result of the 
perspective projection of the three dimensional scene onto a two dimensional 

10 image plane. A vanishing point refers to the point in the image plane (a two 

dimensional projection of the three dimensional scene) where parallel lines in the 
scene meet. Vanishing points generally only have relevance for images containing 
images of a structure containing at least two line segments, generally from man- 
made structures. Several authors have documented methods of automatically 

15 locating the vanishing points of an image. For example, S. Barnard, "Interpreting 
Perspective Images Artificial Intelligence , vol. 21, pp. 435-462, 1983. A 
preferred automated technique is disclosed in pending U.S. Provisional Patent 
Application Serial No. 60/192,195 filed March 27, 2000 by Gallagher, entitled 
Vanishing Point Detection by Training with Ground Truth Data, which is 

20 incorporated by reference. In addition, the vanishing point detector may include 
manually identifying the vanishing points using operator input. The vanishing 
point detector 12 outputs the locations of all vanishing points identified for the 
image. Characteristically, the number of vanishing points determined for a single 
image is not greater than three, although this should not be viewed as a limitation. 

25 As described below, the vanishing points determined for the image are output in 
the form of vanishing point vectors. Assuming that M vanishing points are 
determined for the digital image, the output of the vanishing point detector may be 
represented as vQm > where m ranges from 1 to M, Alternatively, it is possible 
that zero vanishing points may be determined for the digital image. In this event, 



then the present invention cannot draw any conclusions regarding image rotation 
from the location(s) of vanishing point(s). 

If M > 0, then the vanishing point vectors detected by the vanishing 
point detector 12 are input to a vanishing point selector 13. The purpose of 
5 vanishing point selector 13 is to determine those vanishing points that may be 
useful for determining the amount of rotation (i.e. the rotation of the camera from 
the level position at the time of capture) of the digital image by using the 
information contained in the M vanishing point vectors. 

The vanishing point selector 13 determines if any of the M 
1 0 vanishing points associated with the digital image may be used to determine the 
amount of rotation of the digital image. Those vanishing points which may be 
used to determine the format of the image are referred to as "rotation candidate 
vanishing points." 

In the preferred embodiment, a rotation candidate vanishing point 
1 5 is any vanishing point having an associated vanishing point vector vq = [xq, yQ, 
zq] meeting the either of the following two conditions: 

1. \x G \>Tj 

2. M > T] 

Values of Tj between 0.3 and slightly less than 1 are useful in 
20 practicing the present invention. In the preferred embodiment, Tj is set at 0.5. 

The regions in the xq, yQ plane in which such vanishing points lie are illustrated 
by cross hatching in Fig. 6. The vanishing point selector 13 outputs TV (where TV is 
between 0 and M) format candidate vanishing points. 

Alternatively, it should be well understood that those skilled in the 
25 art can easily modify the above stated rules from classifying a vanishing point 

vector vq as a rotation candidate vanishing point. For example, a vanishing point 
vector vq may be classified as a rotation candidate vanishing point if the 
following condition is met: 

V X G 2 +YG 2 >T 2 
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Values of T2 between 0.3 and slightly less than 1 are useful in 
practicing the invention. 

If an image has zero rotation candidate vanishing points (ifN~ 0) , 
then the present invention does not detect the rotation of the digital image using a 
5 feature based on vanishing point location. 

If there is at least one rotation candidate vanishing point 
corresponding to the digital image, then the rotation candidate vanishing point is 
output from the vanishing point selector 13 and input to a rotation detector 14. 
Note that other information may also be input to the rotation detector 14 in order 
10 to aid the determination process. For instance the format (an identification of the 
vertical axis of the image as will be described herein) or orientation (an 
identification of the top of the image) may also be input to the rotation detector 14 
for the purpose of determining the rotation amount. The operation of the rotation 
detector 14 is to determine the angle p between the vertical axis of the image and 
1 5 the vector pointing from the image origin to the vanishing point corresponding to 
the vertical lines in the scene. In other words, the angle P corresponds to the 
amount of rotation of the capture device (for example a camera) about the optical 
axis. 

The angle of rotation for each of the N rotation candidate vanishing 
20 points is computed by the rotation detector 14 by first projecting the vanishing 
point under consideration to the xy-plane. This is accomplished according to the 
following equation: 

[xG»yG>°] 

Gxy / 2 2 

V X G +YG 

The vector vQxy xs a vector which generally points in a direction 
25 parallel to the vertical axis of the scene. 

Next, the angles ^are determined to be the angle between the 
vector vQxy and both the positive and negative vertical axes of the image. The 
vertical axis of an image is the axis on the image plane parallel to either the x-axis 
or the y-axis which also passes through the "top \and "bottom" of the image. The 



vertical axis of the image will^be further explained herein below. If the vertical 
axis of the image is known, then the angles y are computed by taking the inverse 
cosine of the dot product of the two vectors, as is well known in the art. For 
example, if the jy-axis is the vertica\axis of the image, then the angles /may be 
5 determined as: \ 

^ = sign(x G y G )cos" 1 (v Gxy • [0,1,0]) 

/=sign(x G y G )co^" 1 (v Gxy [0,-l,0]) 

where sign(xQ yQ) represents the sign (-1 ory+1) of the product of xq and yQ. For 
example, when xq and yQ are either both negative or both positive, the sign(xQ 

1 0 yG) ~ 1 • Alternatively, when only xq or yQ is negative then sign(xQ yQ) = -1 . 

If the vertical axis of the image is not known by the rotation 
detector 14, then the angles ^are determined to each image axis (for a total of four 
angles y). Thus, for each rotation candidate vanishing point, several angles ^are 
determined as the angle between the vector vQxy representing the projection of the 

1 5 vanishing point onto the xy-plane and the image axes. The angle (3 is determined 
to be the smallest (in an absolute value sense) of these angles y. Note that the 
angle P may be positive or negative in magnitude, indicating the direction of 
camera rotation. The output of the rotation detector 14 is the angle p. Note that 
the angle p is expected to be equal to or less than 7i/4. 

20 Fig. 1 shows that the identification of the vertical axis of the digital 

image is output from the rotation detector 14 and passed to an image transform 19. 
According to an alternative embodiment of the invention, this image transform 
may also receive various other parameters, including the digital image itself. The 
operation of the image transform 19 may be any number of image transformations 

25 that benefit from knowledge of image rotation. 

In an example of the use of the rotation information, the angle p of 
the digital image may be stored as metadata by the image transform 19. Metadata 
is generally defined as data corresponding to an image apart from actual pixel 
values. 



Additionally, the function of the image transform 19 may account 
for the rotation of the image by performing an image rotation in the opposite 
direction. Image rotation is well known by those skilled in the art of image 
processing. The amount the digital image must be rotated is given by the negative 
5 of p. 

In another embodiment, the vanishing point vector is 
interchangeable with the vanishing point location given in Cartesian coordinates. 

Fig. 3 shows several image examples useful for defining terms used 
herein. Fig. 3 A shows an example image of a person. The top of the image is 

10 defined as the side of the image that was in the "up" direction from the 

photographer's perspective at the time of image capture. Identification of the top 
of an image solves the problem of orientation. In the case of Fig. 3 A, the top of 
the image clearly corresponds to the top of the subject's head. Note that the 
bottom of an image is always the image side opposite the image top. Also shown 

15 in Fig. 3 A are axes parallel to the x-axis and the >>-axis passing through the image 
origin. These axes have herein been defined as the x '-axis, and the y -axis, 
respectively, and shall herein be known collectively as image axes. 

Note that the vertical axis of the image is that line that passes 
through the top and bottom of the image, as well as the image origin and 

20 coincident with either the x '-axis or the y -axis. The vertical axis of the image 

defines the format of the image, by specifying which two sides are top and bottom. 
As used herein, the term "format" means the identity of the vertical axis of the 
image. However, the vertical axis of the image does not specify which of the two 
sides is the top, thus the orientation of an image may remain unknown even when 

25 the format is known. Note in the case of the image in Figure 3 A, the y '-axis is the 
vertical axis of the image. 

Fig. 3B shows an example where the top of the image is the right 
image side. In this example, the vertical axis of the image is the x '-axis. Finally, 
Fig. 3C shows an example image where, like the image in Fig. 3 A, the vertical 



axis of the image is they'-axis. However, in this example the top of the image lies 
on the negative^ '-axis. 

Note that if the camera is not held level at the time of image 
capture, then the image may appear to be tilted. That is, if the x-axis of the 
5 imaging system is not orthogonal to the vertical direction in the scene, then the 
image will appear to be tilted or rotated. Commonly, the amount of rotation is 
small since the photographer is making an effort to hold the camera level. For 
example, Fig. 4A shows an embodiment similar to that of Fig. 3 A, except for the 
fact that a simulated amount of camera rotation is included. Additionally, Fig. 4B 

10 discloses an embodiment to that of Fig. 3B with the addition of camera rotation. It 
is possible to show that the amount of the camera rotation (the angle from the 
level position) is the same as the angle p that the image is rotated. As described 
herein, the vanishing point locations detected from the image allow for the 
identification of this angle. 

1 5 Fig. 5 A shows an ima^e that has an apparent amount of camera 

rotation (the camera was tipped in the clockwise direction at the time of capture.) 
The method of the present invention wasVpplied to the image and estimated that 
the amount of rotation of the image in the counter-clockwise direction was 3.4 
degrees. Fig. 5B shows the corrected image\generated by rotating Fig. 5A by 3.4 

20 degrees in the clockwise direction. There is a noticeable improvement in the 

handrail orientation as a result of the processingW the present invention. In the 
case where the image transform 19 performs a rotation of the digital image, there 
may be additional logic based upon the value of (3. For instance, if |P| < 1 degree, 
performing a rotation correction may not produce a result noticeable enough to be 

25 worth the computational effort. Additionally, there ma\ be an upper limit on the 
amount of rotation that the image transform 19 will execute. For example, if |P| = 
44 degrees, it may be advantageous that nothing is done tome image, as a 
modification of this magnitude may produce displeasing results if the value of P 
produced by the algorithm was incorrect. Another aspect of the; invention has the 

30 image transform 19 performing a rotation by an amount of -P, ink resulting image 
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produced has a vanishing point on the vertical axis of the image (assuming that the 
value of p is correct). In Vhis embodiment, the image transform 19 is a rotating 
transformation which operates by rotating the image. Such a transformation is a 
warping of the image, sinceme geometry of the image output from the image 
5 transform 19 has been modified relative to the geometry of the image input to the 
image transform 19. The location of the rotation candidate vanishing point vq of 
the image input to the image transform 19 is an undesirable vanishing point 
location, because it does not lie oA the vertical axis of the image. The vertical axis 
of the image is considered to be a preferable vanishing point location. However, 

1 0 those skilled in the art will recognizeVhat other preferable vanishing point 

locations may exist. For instance, infinity (or zq =0 in vanishing point vector 
notation) may be a preferable vanishing-point location. In this case, the digital 
image may be modified by an image transform 19 designed to warp the image in 
such a manner that the image resulting from the image transform 19 has a 

15 vanishing point now located at infinity. Alternatively, another preferable 

vanishing point location may be at infinity anaValso on the vertical axis of the 
image (in this case, either xq or yQ = 1). Clearly, the vanishing point of the image 
output from the image transform 19 lies on the vertical axis of the image, and is 
therefore a preferable vanishing point location. Thus, the operation of the image 

20 transform 19 is to warp the image in such a fashion that a vanishing point 

associated with the input image migrates from an undesirable vanishing point 
location to a desirable vanishing point location in the imVge output from the image 
transform 19. Those skilled in the art of image processings will recognize that an 
image transform 19 may be created (either deterministicallAor empirically) to 

25 warp an image in such a manner as to relocate a vanishing pmnt from an 
undesirable vanishing point location within the input image to V desirable 
vanishing point location within the resulting image. \ 

Those skilled in the art will recognize that many variations may be 
made to the description of the present invention without significantly deviating 

30 from the scope of the present invention. 
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